Multiresolution channel normalization for ASR in reverberant environments
نویسندگان
چکیده
To overcome the problems related with the long impulse responses produced by reverberation, we use a long time window (high frequency resolution) analysis during the channel normalization steps of the feature extraction process in automatic speech recognition (ASR). After nor-malization, a trade between frequency and time resolution is used to increase the rate at which the time information is sampled (short-time domain), yielding an appropriate domain to derive ASR features. Experiments on data with reverberation times of about 0:5 s show that the new technique achieves signiicant performance improvement of a speech recognizer under reverberation, with only some performance degradation on clean speech.
منابع مشابه
Carlos Avendano , Sangita Tibrewala and Hynek Hermansky , " Multiresolution Channel Normalization for ASR in Reverberant
To overcome the problems related with the long impulse responses produced by reverberation, we use a long time window (high frequency resolution) analysis during the channel normalization steps of the feature extraction process in automatic speech recognition (ASR). After nor-malization, a trade between frequency and time resolution is used to increase the rate at which the time information is ...
متن کاملOn the Use of Artificial Reverberation for Asr in Highly Reverberant Environments
In this paper, we discuss the use of artificial room reverberation methods to increase the performance of automatic speech recognition (ASR) systems in highly reverberant enclosures. Our approach consists in training acoustic models on artificially reverberated speech material. In order to obtain the desired reverberated speech training database, we propose to use a reverberating filter whose i...
متن کاملA corpus-based approach for robust ASR in reverberant environments
In this paper, we discuss the use of artificial room reverberation to increase the performance of automatic speech recognition (ASR) systems in reverberant enclosures. Our approach consists in training acoustic models on artificially reverberated speech material. In order to obtain the desired reverberated speech training database, we propose to use a reverberating filter whose impulse response...
متن کاملModel-based blind estimation of reverberation time: application to robust ASR in reverberant environments
This paper presents a method for blind estimation of reverberation times in reverberant enclosures. The proposed algorithm is based on a statistical model of short-term log-energy sequences for echo-free speech. Given a speech utterance recorded in a reverberant room, it computes a Maximum Likelihood estimate of the room full-band reverberation time. The estimation method is shown to require li...
متن کاملEffectiveness of dereverberation, feature transformation, discriminative training methods, and system combination approach for various reverberant environments
The recently released REverberant Voice Enhancement and Recognition Benchmark (REVERB) challenge includes a reverberant automatic speech recognition (ASR) task. This paper describes our proposed system based on multi-channel speech enhancement preprocessing and state-of-the-art ASR techniques. For preprocessing, we propose a single-channel dereverberation method with reverberation time estimati...
متن کامل